BTCC / BTCC Square / Global Cryptocurrency /
Chinese AI Model Trained on Synthetic Data Outperforms Larger Models Using Nvidia Chips

Chinese AI Model Trained on Synthetic Data Outperforms Larger Models Using Nvidia Chips

Published:
2026-01-26 12:34:02
11
1
BTCCSquare news:

A breakthrough in artificial intelligence demonstrates the potential of synthetic data. Researchers from Tsinghua University, Microsoft Research Asia, and Wuhan University developed a 7-billion-parameter AI model called X-Coder, trained entirely on artificially generated data using Nvidia's H20 and H200 chips. The model outperformed coding systems twice its size trained on human-generated data.

The team Leveraged Nvidia's export-controlled hardware, utilizing 128 H20 chips for 220 hours of supervised fine-tuning followed by 32 H200 chips for seven days of reinforcement learning. This strategic hardware selection—combining inference-optimized H20 and training-focused H200 processors—proved critical given current US export restrictions.

Findings published on arXiv reveal synthetic data follows established scaling laws, challenging conventional wisdom about training data requirements. The SynthSmith pipeline's success suggests compute power, not data sourcing, may become the primary constraint in AI development.

|Square

Get the BTCC app to start your crypto journey

Get started today Scan to join our 100M+ users

All articles reposted on this platform are sourced from public networks and are intended solely for the purpose of disseminating industry information. They do not represent any official stance of BTCC. All intellectual property rights belong to their original authors. If you believe any content infringes upon your rights or is suspected of copyright violation, please contact us at [email protected]. We will address the matter promptly and in accordance with applicable laws.BTCC makes no explicit or implied warranties regarding the accuracy, timeliness, or completeness of the republished information and assumes no direct or indirect liability for any consequences arising from reliance on such content. All materials are provided for industry research reference only and shall not be construed as investment, legal, or business advice. BTCC bears no legal responsibility for any actions taken based on the content provided herein.